Back

Scientific Data

30 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
The Effects of External Laser Positioning Systems for MRI Simulation on Image Quality and Quantitative MRI Values
2026-03-07 radiology and imaging 10.64898/2026.03.06.26347809
#1 (5.0%)
Show abstract

Background and Purpose: Magnetic resonance imaging (MRI) for radiation therapy treatment planning is currently being used in many anatomical sites to better visualize soft tissue landmarks, a technique known as an MRI simulation. A core component of modern MRI simulation configurations are the use of external laser positioning systems (ELPS) to help set up the patient. Though necessary for accurate and reproducible patient setup, the ELPS, if left on during imaging, may interfere negatively with...

2
An agentic AI system enhances clinical detection of immunotherapy toxicities: a multi-phase validation study
2026-03-02 oncology 10.64898/2026.02.26.26347179
Top 0.1% (4.0%)
Show abstract

Immune-related adverse events (irAEs) affect up to 40% of patients receiving immune checkpoint inhibitors, yet their identification depends on laborious and inconsistent manual chart review. Here we developed and evaluated an agentic large language model system to extract the presence, temporality, severity grade, attribution, and certainty of six irAE types from clinical notes. Retrospectively (263 notes), the system achieved macro-averaged F1 of 0.92 for detection and 0.66 for multi-class seve...

3
Graph-Augmented Retrieval for Digital Evidence-Based Medical Synthesis: A Proof-of-Concept Study on Topology-Aware Mechanistic Narrative Generation
2026-02-19 health systems and quality improvement 10.64898/2026.02.18.26346545
Top 0.2% (3.8%)
Show abstract

BackgroundRetrieval-augmented generation (RAG) frameworks such as RAPID [1] have demonstrated that staged planning and retrieval grounding improve long-form text generation. However, most implementations remain similarity-driven and open-domain, lacking the epistemic safeguards required for biomedical synthesis, where mechanistic completeness, temporal governance, traceability, and explicit gap classification are essential. ObjectiveTo develop and evaluate a topology-aware, graph-augmented retr...

4
End-to-End PET/CT Interpretation and Quantification with an LLM-Orchestrated AI Agent: A Real-World Pilot Study
2026-02-25 radiology and imaging 10.64898/2026.02.21.26346798
Top 0.2% (3.6%)
Show abstract

BackgroundAlthough deep learning models have improved individual PET analysis, image processing and quantification tasks, end-to-end automation from raw DICOM to quantitative clinical reporting remains limited, particularly in heterogeneous real-world settings. MethodsAs a proof-of-concept, an autonomous large language model (LLM)-orchestrated multi-tool agent for end-to-end PET/CT interpretation was developed. A reasoning-based text LLM selected appropriate series from raw DICOM, coordinated r...

5
BUDAPEST: A Fast and Reliable Bayesian Algorithm for TMS Threshold Estimation with an Open-Source GUI and Human Validation
2026-03-04 radiology and imaging 10.64898/2026.03.03.26347528
Top 0.3% (3.0%)
Show abstract

BackgroundMotor threshold (MT) estimation is fundamental to transcranial magnetic stimulation (TMS), guiding individualized stimulation intensity in research and therapy. Conventional methods such as the 5-out-of-10 rule require many stimuli, while adaptive approaches like Parameter Estimation by Sequential Testing (PEST) improve efficiency but can exhibit poor convergence under certain conditions. ObjectiveThis study introduces the Bayesian Uncertainty Dynamic Algorithm for Parameter Estimatio...

6
UCSF RMaC: University of California San Francisco 3D Multi-Phase Renal Mass CT Dataset with Tumor Segmentations
2026-02-12 radiology and imaging 10.64898/2026.02.11.26346096
Top 0.4% (2.2%)
Show abstract

Current standard of care imaging practices cannot reliably differentiate among certain renal tumors such as benign oncocytoma and clear cell renal cell carcinoma (RCC), and between low and high grade RCCs. Previous work has explored using deep learning, radiomics, and texture analysis to predict renal tumor subtypes and differentiate between low and high grade RCCs with mixed success. To further this work, large diverse datasets are needed to improve model performance and provide strong evaluati...

7
On the assessment of deep-learning based super-resolution in small datasets of human brain MRI scans
2026-02-17 radiology and imaging 10.64898/2026.02.16.26346392
Top 0.4% (2.2%)
Show abstract

Deep-learning based super-resolution has shown promise for enhancing the spatial resolution of brain magnetic resonance images, which may help visualize small anatomical structures more clearly. However, when only limited training data are available, it remains uncertain which model assessment method provides the most reliable estimate of out-of-sample performance. In this study, three widely used assessment strategies (three-way holdout, k-fold cross-validation, and nested cross-validation) wer...

8
Auricular Muscle- controlled Navigation for Powered Wheelchairs
2026-03-03 rehabilitation medicine and physical therapy 10.64898/2026.02.28.26347311
Top 0.5% (2.2%)
Show abstract

There are many alternative methods to joystick control for control of Electric Powered Wheelchairs for users with neuromuscular disabilities, such as muscular dystrophy, and spinal cord injuries, such as tetraplegia. However, these methods- which include the sip-and-puff method, head and neck movement, blinking, or tongue movement- hinder social interaction, and are therefore detrimental to user independence. In recent years, research has explored the use of Electromyography (EMG) signals from a...

9
Quality versus quantity of training datasets for artificial intelligence-based whole liver segmentation
2026-02-18 radiology and imaging 10.64898/2026.02.17.26346486
Top 0.6% (2.1%)
Show abstract

Artificial intelligence (AI) based segmentation has many medical applications but limited curated datasets challenge model training; this study compares the impact of dataset annotation quality and quantity on whole liver AI segmentation performance. We obtained 3,089 abdominal computed tomography scans with whole-liver contours from MD Anderson Cancer Center (MDA) and a MICCAI challenge. A total of 249 scans were withheld for testing of which 30, MICCAI challenge data, were reserved for externa...

10
Image Quality Evaluation of Neonatal Brain MRI Using a Deep Learning Reconstruction Algorithm: A Quantitative and Multireader Study Using Variable Denoising Levels at 3 Tesla
2026-02-09 radiology and imaging 10.64898/2026.02.04.26345479
Top 0.6% (2.0%)
Show abstract

PurposeNeonatal imaging is particularly challenging because newborns have a high likelihood of head motion, which can degrade image quality and complicate interpretation. Improving MRI brain image quality may help reduce diagnostic uncertainty and facilitate the nuanced assessment of early myelinating structures in the neonatal brain. Although deep learning reconstruction algorithms designed to improve MRI image quality have been evaluated in pediatric imaging, they have not been specifically st...

11
Segmentation of metabolically relevant adipose tissue compartments and ectopic fat deposits
2026-02-27 radiology and imaging 10.64898/2026.02.25.26347069
Top 0.7% (2.0%)
Show abstract

Chemical shift-encoded magnetic resonance imaging using high-resolved 3D Dixon techniques enables the non-invasive and radiation-free assessment of whole-body adipose tissue and ectopic fat distribution. Automatic deep learning-based segmentation of metabolically relevant adipose tissue compartments and ectopic fat deposits in parenchymal tissue is the most important image processing step for the quantification of adipose tissue volumes and ectopic fat percentages from whole-body imaging. This ...

12
An Exploratory Study of ResNet and Capsule Neural Networks for Brain Tumor Detection in MRI
2026-02-09 radiology and imaging 10.64898/2026.02.05.26345460
Top 0.7% (2.0%)
Show abstract

Brain tumors are one of the most life-threatening diseases, requiring precise and timely detection for effective treatment. Traditional methods for brain tumor detection rely heavily on manual analysis of MRI scans, which is time-consuming, subjective, and prone to human error. With advancements in deep learning, Convolutional Neural Networks (CNNs) have become popular for medical image analysis. However, CNNs are limited in their ability to capture spatial hierarchies and pose variations, which...

13
The NLP-to-Expert Gap in Chest X-ray AI
2026-03-02 radiology and imaging 10.64898/2026.02.27.26347261
Top 0.8% (2.0%)
Show abstract

In previous work, we achieved state-of-the-art performance on ChestX-ray14 (ROC-AUC 0.940, F1 0.821) using pretraining diversity and clinical metric optimization. Applying the same methodology to CheXpert, we received similar results when using NLP valuation and test data--but when evaluated against expert radiologist labels, performance was only 0.75-0.87 ROC-AUC. The models had learned to match the automated NLP labeling system, not to diagnose disease. This paper documents our investigation ...

14
Deep Neural Patchworks Predict Renal Imaging Biomarkers from Non-Contrast MRI via Knowledge Transfer from Arterial-Phase Contrast-Enhanced MRI
2026-02-26 radiology and imaging 10.64898/2026.02.24.26346961
Top 0.9% (1.9%)
Show abstract

Rationale and ObjectivesContrast-enhanced (CE) MRI provides clear corticomedullary contrast for renal compartment delineation but may be contraindicated or undesirable in routine practice. We aimed to enable automated extraction of renal imaging biomarkers from routine non-contrast-enhanced (NCE) T1-weighted MRI by transferring CE-derived compartment labels. Materials and MethodsThis retrospective single-center study (January 2017 to December 2021) included 200 participants with paired arterial...

15
DBT-2026, a de-identified publicly available dataset of digital breast tomosynthesis exams with ground truth biopsies
2026-03-04 radiology and imaging 10.64898/2026.03.03.25337924
Top 0.9% (1.9%)
Show abstract

Digital breast tomosynthesis (DBT) is a powerful imaging modality that allows for improved lesion visibility, characterization, and localization compared to conventional two-dimensional digital mammography. DBT has been increasingly adopted in screening and diagnostic settings globally, particularly for women with dense breast tissue where tissue overlap presents a significant diagnostic challenge. Here we describe DBT-2026, a real world imaging dataset with 558 DBT exams from 558 patients with ...

16
Anatomically and Biochemically Guided Deep Image Prior for Sodium MRI Denoising
2026-03-02 health informatics 10.64898/2026.02.27.26347249
Top 1% (1.8%)
Show abstract

Sodium (23Na) magnetic resonance imaging (MRI) provides valuable metabolic information, but it is limited by a low signal-to-noise ratio (SNR) and long acquisition times. To overcome these challenges, we present a Deep Image Prior (DIP)-based framework that combines anatomically guided proton (1H) MRI and metabolically guided 23Na MRI denoising via a fused proton-sodium prior within a directional total variation (dTV) regularization scheme. The DIP-Fusion approach minimizes a variational loss fu...

17
Real-Time Detection of Breast Cancer-Related Lymphedema with Shear-Wave Elastography: The Holder-Optimized Elastography Method
2026-03-02 radiology and imaging 10.64898/2026.02.25.26344759
Top 1% (1.6%)
Show abstract

BackgroundBreast cancer-related lymphedema (BCRL) is a common complication following breast cancer treatment. While lymphoscintigraphy is considered the diagnostic gold standard, it is unsuitable for routine periodic monitoring or assessment of treatment efficacy. Shear wave elastography (SWE) offers a possible alternative, but traditional modes of operation limit its potential. Proposed SolutionsThe Holder-Optimized Elastography (HOE) method is introduced to eliminate pressure issues introduce...

18
Benchmarking Transfer Learning for Dense Breast Tissue Segmentation on Small Mammogram Datasets
2026-02-24 radiology and imaging 10.64898/2026.02.23.26346855
Top 1% (1.6%)
Show abstract

Dense breast tissue diminishes the sensitivity of mammographic screening and is a key cancer risk factor, which motivates accurate segmentation under scarce and expensive expert annotations in the medical imaging domain. Here, we benchmark the effect of backbone architecture, self-supervised pre-training (SSL), fine-tuning strategy, and loss design for dense-tissue segmentation on a small expert-labeled dataset (596 images) and an in-domain unlabeled corpus (20, 000 images), reflecting the lack ...

19
Ai-Driven Diagnosis Of Non-Alcoholic Fatty Liver Disease And Associated Comorbidities
2026-02-18 health informatics 10.64898/2026.02.12.26345169
Top 1% (1.6%)
Show abstract

Non-alcoholic fatty liver disease (NAFLD) is a globally prevalent hepatic condition caused by the buildup of fat in the liver. It is frequently associated with metabolic comorbidities such as hypertension, cardiovascular disease (CVD), and prediabetes. However, early detection remains challenging due to the asymptomatic progression, and existing primary diagnostic methods, such as imaging or liver biopsy, are often expensive and inaccessible in rural areas. This study proposes a two-stage, inter...

20
Knowledge augmented causal discovery through large language models and knowledge graphs: application in chronic low back pain
2026-02-18 neurology 10.64898/2026.02.13.26346255
Top 1% (1.6%)
Show abstract

Causal discovery algorithms are often leveraged for inferring causal relationships and recovering a causal model from data. However, causal discovery from data alone is limited by the structural constraints of the used dataset, the lack of causal logic, and the lack of external knowledge. Thus, data-driven causal discovery can only suggest possible causal relationships at best. To overcome these limitations, Large Language Models (LLMs) and knowledge systems, such as Retrieval-Augmented Generati...